Picture for Xinyuan Chen

Xinyuan Chen

PARE: Pruning and Adaptive Routing for Efficient Video Generation

Add code
May 26, 2026
Viaarxiv icon

BEAT: Rhythm-Elastic Alignment for Agentic Music-guided Movie Trailer Generation

Add code
May 26, 2026
Viaarxiv icon

LamPO: A Lambda Style Policy Optimization for Reasoning Language Models

Add code
May 20, 2026
Viaarxiv icon

LambdaPO: A Lambda Style Policy Optimization for Reasoning Language Models

Add code
May 19, 2026
Viaarxiv icon

Uni-Animator: Towards Unified Visual Colorization

Add code
Feb 26, 2026
Viaarxiv icon

ShotDirector: Directorially Controllable Multi-Shot Video Generation with Cinematographic Transitions

Add code
Dec 11, 2025
Figure 1 for ShotDirector: Directorially Controllable Multi-Shot Video Generation with Cinematographic Transitions
Figure 2 for ShotDirector: Directorially Controllable Multi-Shot Video Generation with Cinematographic Transitions
Figure 3 for ShotDirector: Directorially Controllable Multi-Shot Video Generation with Cinematographic Transitions
Figure 4 for ShotDirector: Directorially Controllable Multi-Shot Video Generation with Cinematographic Transitions
Viaarxiv icon

LIA-X: Interpretable Latent Portrait Animator

Add code
Aug 13, 2025
Viaarxiv icon

Consistent and Controllable Image Animation with Motion Linear Diffusion Transformers

Add code
Aug 10, 2025
Figure 1 for Consistent and Controllable Image Animation with Motion Linear Diffusion Transformers
Figure 2 for Consistent and Controllable Image Animation with Motion Linear Diffusion Transformers
Figure 3 for Consistent and Controllable Image Animation with Motion Linear Diffusion Transformers
Figure 4 for Consistent and Controllable Image Animation with Motion Linear Diffusion Transformers
Viaarxiv icon

Self-Improvement for Audio Large Language Model using Unlabeled Speech

Add code
Jul 27, 2025
Viaarxiv icon

GenHOI: Generalizing Text-driven 4D Human-Object Interaction Synthesis for Unseen Objects

Add code
Jun 18, 2025
Figure 1 for GenHOI: Generalizing Text-driven 4D Human-Object Interaction Synthesis for Unseen Objects
Figure 2 for GenHOI: Generalizing Text-driven 4D Human-Object Interaction Synthesis for Unseen Objects
Figure 3 for GenHOI: Generalizing Text-driven 4D Human-Object Interaction Synthesis for Unseen Objects
Figure 4 for GenHOI: Generalizing Text-driven 4D Human-Object Interaction Synthesis for Unseen Objects
Viaarxiv icon